Initial Sampling for Automatic Interactive Data Exploration
نویسندگان
چکیده
In many real world applications, users might not know the queries to send to a database in order to retrieve data in the user-interested areas. Users can apply a trial and error method to discover the queries. However, as the data set is usually quite large, the discovery of queries will take a long time and the whole process is labor-intensive. We want to build a discovery-oriented, interactive data exploration system, that guides users to their interested data areas through interactive sample labeling process. In each iteration, the system will strategically select some sample points to present to users for feedback, as relevant or irrelevant, and finally converge to a query that is able to retrieve all the data in the user-interested area.
منابع مشابه
A Framework for Knowledge-based Interactive Data Exploration
In this paper, we propose a framework that combines the functionality of data exploration and automatic presentation systems to create a knowledge-based, interactive, data exploration system. The purpose of a data exploration system is to enable users to uncover and extract relationships hidden in large data sets. The purpose of an automatic presentation system is to reduce the need for users a...
متن کاملVisibility-difference entropy for automatic transfer function generation
Direct volume rendering allows for interactive exploration of volumetric data and has become an important tool in many visualization domains. But the insight and information that can be obtained are dependent on the transfer function defining the transparency of voxels. Constructing good transfer functions is one of the most time consuming and cumbersome tasks in volume visualization. We presen...
متن کاملFramework for Interactive Million-Neuron Simulation Running title: Interactive Million Neuron Simulation
Large simulations have become increasingly complex in many fields, tending to incorporate scaledependent modeling and algorithms and wide-ranging physical influences. This scale of simulation sophistication has not yet been matched in neuroscience. In this paper we describe a framework aimed at enabling natural interaction with complex simulations: their configuration, initial conditions, monit...
متن کاملCombining User Interaction, Speculative Query Execution and Sampling in the DICE System
The interactive exploration of data cubes has become a popular application, especially over large datasets. In this paper, we present DICE, a combination of a novel frontend query interface and distributed aggregation backend that enables interactive cube exploration. DICE provides a convenient, practical alternative to the typical offline cube materialization strategy by allowing the user to e...
متن کاملAIDE: An Automated Sample-based Approach for Interactive Data Exploration
In this paper, we argue that database systems be augmented with an automated data exploration service that methodically steers users through the data in a meaningful way. Such an automated system is crucial for deriving insights from complex datasets found in many big data applications such as scientific and healthcare applications as well as for reducing the human effort of data exploration. T...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2016